Audio Classification and Retrieval by Using Vector Quantization
نویسندگان
چکیده
In today’s world, we can say that information and its processing has become the critical aspect for functioning of everything. In the early days, information was generally obtained and processed in the form of text. Today information is available in all forms namely, text, music, graphics, etc. which are a easily understandable and accurately represent information. Information is first captured then the captured information is retrieved and analyzed for further requirements. In this paper, the information that we take into consideration is in audio form. We have studied the feature vector extraction methods, similarity measurement techniques, and have also measured the performance parameters. It has been observed that the use of multiple feature vectors provides better and more accurate classification and retrieval of audios from large database.
منابع مشابه
An Efficient Tree-based Quantization for Content Based Music Retrieval System
In this paper, we have proposed and implemented a new music retrieval system based on content of music wave files. We have investigated different quantization methods by constructing them into the music data histograms as the feature vectors for the music files. There are three important aspects that will affect implementation of the system: audio feature extraction, quantization and distance c...
متن کاملA Similarity Measure for Automatic Audio Classification
This paper presents recent results using statistics generated by a MMl-supervised vector quantizer as a measure of audio similarity. Such a measure has proved successful for talker identification, and the extension from speech to general audio, such as music, is straightforward. A classifier that distinguishes speech from music and non-vocal sounds is presented, as well as experimental results ...
متن کاملContent-based methods for the management of digital music
The literature on content-based music retrieval has largely finessed acoustic issues by using MIDI format music. This paper however considers content-based classification and retrieval of a typical (MPEG layer III) digital music archive. Two statistical techniques are investigated and appraised. Gaussian Mixture Modelling performs well with an accuracy of 92% on a music classification task. A T...
متن کاملA high speed unsupervised speaker retrieval using vector quantization and second-order statistics
This paper describes an effective unsupervised method for query-by-example speaker retrieval. We suppose that only one speaker is in each audio file or in audio segment. The audio data are modeled using a common universal codebook. The codebook is based on bag-of-frames (BOF). The features corresponding to the audio frames are extracted from all audio files. These features are grouped into clus...
متن کاملLatent topic model for audio retrieval
Latent topic model such as Latent Dirichlet Allocation (LDA) has been designed for text processing and has also demonstrated success in the task of audio related processing. The main idea behind LDA assumes that the words of each document arise from a mixture of topics, each of which is a multinomial distribution over the vocabulary. When applying the original LDA to process continuous data, th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014